Speaker-independent consonant recognition by integrating discriminant analysis and hmm
نویسندگان
چکیده
In this paper, we propose a new consonant recogmtIOn method which integrates two stochastic method: discriminant analysis and HMM (Hidden Markov Models). Discriminant Analysis is effective to analyze local patterns around the reference-point of a consonant such as a burst point. This method, however, is based on the assumption that the reference-point is detected precisely. HMM is able to extract the global dynamic features of a consonant from the preceding vowel to the following vowel and needs no explicit segmentation of speech. But it is hard to discriminate between similar consonants with HMM due to the quantization of input pattern vectors. Our new method constructs HMM with discriminant analysis front-end and recognizes consonants by combining the score obtained by discriminant analysis and the score by HMM. In recognition experiments of all the Japanese consonants in mono-syllables, this integrated method achieved the recognition rate of 92.1 %, which is higher by 5~ 15 % than the case using either of two methods alone.
منابع مشابه
Normalized discriminant analysis with application to a hybrid speaker-verification system
A modified linear discriminant analysis technique for speaker verification, referred to here as normalized discriminant analysis (NDA), is presented. Using this technique it is possible to design an efficient linear classifier with very limited training data and to generate normalized discriminant scores with comparable magnitudes for different classifiers. The NDA technique is applied to a cla...
متن کاملSpeaker adaptation for audio-visual speech recognition
In this paper, speaker adaptation is investigated for audiovisual automatic speech recognition (ASR) using the multistream hidden Markov model (HMM). First, audio-only and visual-only HMM parameters are adapted by combining maximum a posteriori and maximum likelihood linear regression adaptation. Subsequently, the audio-visual HMM stream exponents are adapted to better capture the reliability o...
متن کاملNeural networks for nonlinear discriminant analysis in continuous speech recognition
In this paper neural networks for Nonlinear Discrimi nant Analysis in continuous speech recognition are pre sented Multilayer Perceptrons are used to estimate a posteriori probabilities for Hidden Markov Model states which are the optimal discriminant features for the sepa ration of the HMM states The a posteriori probabilities are transformed by a principal component analysis to calcu late the...
متن کاملشبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملUnsupervised learning of HMM topology for text-dependent speaker verification
Usually, text-dependent speaker verification can achieve better performance than text-independent system because of the constraint that the enrollment and testing utterance share the same phonetic content. However, the enrollment data for text-dependent system usually is very limited. Expectation Maximization(EM) training of HMM will suffer from noisy estimation because of limited enrollment. A...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Systems and Computers in Japan
دوره 22 شماره
صفحات -
تاریخ انتشار 1991